Neural Network Based Nonlinear Discriminant Analysis for Speech Recognition
نویسندگان
چکیده
Neural networks have been one of the most successful recognition models for automatic speech recognition systems because of their high discriminative power and adaptive learning. In many speech recognition tasks, especially for discrete speech classification, it has been shown that neural networks are very powerful for classifying short-time acoustic-phonetic units, such as individual phonemes. Moreover, neural networks have a strong ability for dimensionality reduction. In contrast to many linear dimensionality reduction techniques including Principal Components Analysis (PCA) and Linear Discriminant Analysis (LDA), neural network based nonlinear reduction approaches are able to form a dimensionally-reduced representation for complex data such as speech features, while preserving variability and discriminability of the original data. In this paper, a neural network is combined with Hidden Markov Models (HMMs) for a continuous phonetic speech recognition system, in which the neural network is trained with phonetic labeling information as a classifier to maximize discrimination among speech features for the speech recognition based on HMMs. Additionally, the dimensionality of speech features is reduced by the neural network with the goal of creating a compact set of highly discriminative features for accurate speech recognition. Experimental evaluation using the TIMIT database shows that the combination of neural networks and HMMs is quite effective for improving recognition accuracy.
منابع مشابه
Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods
Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...
متن کاملNeural networks for nonlinear discriminant analysis in continuous speech recognition
In this paper neural networks for Nonlinear Discrimi nant Analysis in continuous speech recognition are pre sented Multilayer Perceptrons are used to estimate a posteriori probabilities for Hidden Markov Model states which are the optimal discriminant features for the sepa ration of the HMM states The a posteriori probabilities are transformed by a principal component analysis to calcu late the...
متن کاملبهبود عملکرد سیستم بازشناسی گفتار پیوسته بوسیله ویژگیهای استخراج شده از مانیفولدهای گفتاری در فضای بازسازی شده فاز
The design for new feature extraction methods out of the speech signal and combination of their obtained information is one of the most effective approaches to improve the performance of automatic speech recognition (ASR) system. Recent researches have been shown that the speech signal contains nonlinear and chaotic properties, but the effects of these properties are not used in the continuous ...
متن کاملشبکه عصبی پیچشی با پنجرههای قابل تطبیق برای بازشناسی گفتار
Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...
متن کاملNonlinear Discriminant Analysis Based Feature Dimensionality Reduction for Automatic Speech Recognition By
All rights reserved INFORMATION TO ALL USERS The quality of this reproduction is dependent upon the quality of the copy submitted. In the unlikely event that the author did not send a complete manuscript and there are missing pages, these will be noted. Also, if material had to be removed, a note will indicate the deletion. Abstract Automatic Speech Recognition (ASR) has advanced to the point w...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009